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ABSTRACT 



A Web browser running on a client machine typically 
includes an address field in which a Uniform Resource 
Locator (URL) may be entered. The URL identifies a 
particular server (or file) located at a target Web site. If a 
given URL is entered incorrectly at the Web client, a fuzzy 
URL detection scheme automatically performs a fuzzy 
search that returns a list of URLs that most closely match 
what was originally entered into the browser address field. 
The user can then select the correct URL from the list and 
launch the browser to the desired site, or to a directory or file 
within that site. If the fuzzy search does not reveal a match, 
the browser may contact a server dedicated to performing a 
broader fuzzy search. In another alternative, the browser 
contacts a Web server and the fuzzy search is implemented 
at the Web server in order to return a particular file. 

24 Claims, 5 Drawing Sheets 
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.C WWW.USAT0DAY.COM ♦ 1 , WWW.IBM.COM ♦ 1, WWW.CNN.COM ♦ 2, WWW.YAH00.COM * 1, 

.1 WWW.IBM.COM* 1, 

.M WWW.IBM.COM* 1, 

.N WWW.CNN.COM* 1, 

.0 WWW.YAH00.COM* 1, 

.U WWW.USAT0DAY.COM *1, 

.W WWW.USAT0DAY.COM * 1 , WWW.IBM.COM • 1 , WWW.CNN.COM * 1 , WWW.YAH00.COM * 1 , 

.Y WWW.USAT0DAY.COM * 1 , WWW.YAH00.COM * 1 , 

AD WWW.USAT0DAY.COM* 1, 

AH WWW.YAH00.COM* 1, 

AS WWW.USAT0DAY.COM *1, 

AT WWW.USAT0DAY.COM*!, 

AY WWW.USAT0DAY.COM * 1 , WWW.YAH00.COM * 1 , 

Bl WWW.IBM.COM* 1, 

BM WWW.IBM.COM* 1, 

CN WWW.CNN.COM*!, 

CO WWW.USAT0DAY.COM * 1 , WWW.IBM.COM • 1 , WWW.CNN.COM ♦ 1 , WWW.YAH00.COM * 1 , 

DO WWW.USAT0DAY.COM *1, 

HO WWW.YAH00.COM *1, 

MO WWW.USAT0DAY.COM * 1, WWW.IBM.COM * 1. WWW.CNN.COM ♦ 1. WWW.YAH00.COM * 1, 

NN WWW.CNN.COM* I, 

00 WWW.YAH00.COM * 1 , 

OT WWW.USAT0DAY.COM* 1, 

SU WWW.USAT0DAY.COM* 1, 

WW WWW.USAT0DAY.COM * 2. WWW.IBM.COM ♦ 2, WWW.CNN.COM * 2, WWW.YAH00.COM * 2, 

FIG. 7A 



PAIR COUNT 



.C 1 

.U 1 

.W 1 

.Y 1 

AD 1 

AS 1 

AT 1 

AY 1 

CO 1 

DT 1 

MO 1 

SU 1 

WW 2 

FIG. 7B 



05/20/2004, EAST Version: 1.4.1 



1 ' 

U.S. Patent jui. 18,2000 sheet 5 of 5 6,092,100 



PAIR QQQ STRINGS 



.c 


1 WWW.USAT0DAY.COM * 


.u 


1 WWW.USAT0DAY.COM ♦ 


.w 


1 WWW.USAT0DAY.COM * 


.Y 


1 WWW.USAT0DAY.COM * 


AO 


1 WWW.USAT0DAY.COM * 


AS 


1 WWW.USAT0DAY.COM ♦ 


AT 


1 WWW.USATODAY.COM « 


AY 


1 WWW.USATODAY.COM * 


CO 


1 WWW.USAT0DAY.COM * 


DT 




MO 


1 WWW.USATODAY.COM * 


SU 


1 WWW.USATODAY.COM » 


WW 


2 WWW.USAT0DAY.COM * 



WWW.IBM.COM ♦ 1, WWW.CNN.COM « 2. WWW.YAH00.COM • 1, 

WWW.IBM.COM * 1, WWW.CNN.COM ♦ 1. WWW.YAH00.COM » 1, 
WWW.YAH00.COM* 1, 



WWW.YAH00.COM* 1. 
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WWW.IBM.COM * 2. WWW.CNN.COM * 2, WWW.YAH00.COM * 2, 
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WWW.USAT0DAY.COM = 13 * 12 = 156 
WWW.YAHO0.COM = 8*7 =56 
WWW.CNN.COM = 7*5 =35 
WWW.IBM.COM = 6*5 =30 

FIG. 7D 



WWW.USAT0DAY.COM = (0+0+0+0+0+0+0+0+0+1+0+0+0) + 9 = 10 

WWW.YAH00.COM = (0+1+0+0+1+1 + 1+0+0+1+0+0+0) +36 = 41 

WWW.IBM.COM = 0+1+0+1 + 1+1 + 1 + 1+0+1+0+1+0) +64 = 72 

WWW.CNN.COM = (1 + 1+0+1 + 1+1 + 1 + 1+0+1+0+1+0) +64 = 73 

FIG. 8 
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METHOD FOR INTELLIGENTLY SUMMARY OF THE INVENTION 

RESOLVING ENTRY OF AN INCORRECT _ . , u . f(L . , 

UNIFORM RESOURCE LOCATOR (URL) 11 15 ^ us a P nmarv ob J ect of the P resent invention to 

implement a "fuzzy" address detection scheme in a client- 

BACKGROUND OF THE INVENTION server computer network such as the Internet's World Wide 

1. Technical Field Web. 

The present invention relates generally to transactions It is another primary object of this invention to enable a 

over computer networks and more particularly to a method user of a Web client to specify a vaguely-remembered URL 

for enabling communications between a client and server in character string and let a fuzzy detection scheme automati- 

the event that a server name has been typed or otherwise jq ca u y determine the intended domain name, 

entered incorrectly by a user. , { is anothef primary object of ^ invention t0 intelli . 

2. Description of the Related Art ... gently resolve an incorrect server address (e.g., a URL) in 
The World Wide Web is the Internet's multimedia infor- t he event that a user errs in entering a target address. 

mation retrieval system. In the Web environment client {{ {s ^ anQther im rtant object of this invention t0 

machines effect transactions^, Web servers using the Hyper- 1S fesolve a ufwxyt . URL entered at a Web client to facilitate 

text Transfer Prolog communications between the Web client and a target Web 
protocol providing users access to files (e.g., text, graphics, 

images, sound, video, etc.) using a standard page description ' .,, , „ . . . 

i , „ ( ( w , r /u-r\Mi \ Still another object of this invention is to implement a 

language known as Hypertext Markup Language (HTML). S erver-based "fuzzy" URL detection scheme to facilitate 

HTML provides basic document formatting and allows the , n server - based * uzz y ^KL detection scheme to tacilitate 

, , , ■*? «i- i »* *i. a ci i *u mtelligent resolution of incorrect URLs entered at a given 

developer to specify links to other servers and files. In the * 6 

Internet paradigm, a network path to a server is identified by D ' 

a so-called Uniform Resource Locator (URL) having a A more particular object of this invention is to provide a 

special syntax for defining a network connection. Use of an URL detection scheme from a given Web server 

HTML-compatible browser (e.g., Netscape Navigator or 25 domam to mtelhgently resolve "fuzzy" file requests directed 

Microsoft Internet Explorer) at a client machine involves 10 files Wlthin the domain. 

specification of a link via the URL. In response, the client Still another aspect of this invention involves distributing 
makes a request to the server (sometimes referred to as a the fuzzy URL detection logic across multiple components 
"Web site") identified in the link and, in return, receives in in the network. Thus, a given client machine includes a 
return a document or other object formatted according to 30 fuzzy URL detection engine that tests the URL against a 
HTML. "local" history list. This is useful in the case where even the 
Typically, a user specifies a given URL manually by IP address portion of the URL (namely, the second 
typing the desired character string in an address field of the level domain name) is not recognized by the browser. If the 
browser. Existing browsers provide some assistance in this URL detection engme at the client cannot find a match, the 
regard. In particular, both Netscape Navigator (Version 3.0 35 browser may be launched to another server running the 
and higher) and Microsoft Internet Explorer (Version 3.0 ksosy URL detection engine, preferably against a more 
and higher) store URLs that have been previously accessed broad-based archive of URLs. The server may be located at 
from the browser during a given time period. Thus, when the or associated with an Internet Service Provider (ISP), or it 
user begins entering a URL, the browser performs a "type- ma y simply be another task running in the same machine or 
ahead" function while the various characters comprising the 40 environment. The fuzzy search performed at the server then 
string are being entered. Thus, for example, if the given URL returDS lo the Web chent a hst of candidate URLs for 
is "http://www.ibm.com" (and that URL is present in the selection. Further, in the situation where the entered char- 
URL list), the browser parses the initial keystrokes against acter string includes a correct IP address, the client machine 
the stored URL list and provides a visual indication to the win be connected to the server but there still may be a 
user of a "candidate" URL that the browser considers to be 45 (directory or file name) error in the URL. Thus, according to 
a "match". Thus, as the user is entering the URL he or she the invention, a target Web site also includes a fuzzy 
desires to access, the browser may "look ahead" and pull a detection engine for resolving such errors, 
candidate URL from the stored list that matches. If the Thus, in the preferred embodiment, a fuzzy detection 
candidate URL is a match, the user need not complete entry scheme is implemented across multiple components in the 
of the fully-resolved URL; rather, he or she simply actuates 50 computer network. 

the "enter" key and the browser is launched to the site. Regardless of where the detection engine is supported, the 

URL resolution through this "look ahead" approach has fuzzy search returns a list of URLs that most closely match 

provided some benefits, but the technique is unsatisfactory what was originally entered into the browser address field, 

because the target URL may not be on the saved list. The user can then select the correct URL from the list and 

Alternatively, a portion of the target URL (e.g., the second 55 launch the browser to the desired site, or to a directory or file 

level domain name) may be saved in the list but the typing within that site. Alternatively, the browser may be launched 

error may be a particular directory or file name toward the and/or otherwise connected to the particular site and/or 

end of the long string of characters. In either case, the user directory or file, as the case may be. 

is forced lo enler a long character string, only to find that the In accordance with a more particular object of the 

string cannot be meaningfully resolved (by a network nam- 60 invention, a method of detecting a URL includes a number 

ing service or a particular Web server, as the case may be). of steps initiated when a user at a Web client enters a URL. 

If the URL includes an error, a "server not found" error if the URL is not fully recognized (at the client, the 

message or the like is returned to the user. dedicated ISP server or at a Web server, as the case may be), 

It would be desirable to provide an improved technique to a fuzzy search is performed against entries of an address 

intelligently resolve an incorrect URL even if the browser 65 table that includes candidate URLs, with each URL hashed 

has never accessed the target server. This is the problem into a set of N adjacent letters (with N -2, for example). The 

addressed by the present invention. fuzzy search returns to the client a list of URLs that most 
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closely match the entered URL. The browser may be network path to a server is identified by a Uniform Resource 

launched automatically to a "best" match if the fuzzy search Locator (URL), as is well-known. A URL is a specific form 

identifies a particular candidate URL with a given level of of Uniform Resource Identifier (URI), as implemented in the 

confidence. Alternatively, or if several candidate URLs are HTTP 1.1 Specification, Internet Engineering Task Force 

returned, the user may be prompted to select a URL from the 5 (IETF) RFC xxxx, which is incorporated herein by refer- 

list, The browser is then launched to the URL. ence. Although the principles of the present invention are 

The above objects are not limited to resolving incorrect described in the context of a URL fuzzy detection scheme, 

URLs directed to HTTP-compliant Web servers. the invention may be generalized to intelligent resolution of 

Generalizing, the principles of the present invention are also an Y URI in P ut error from an Internet Protocol (IP) client 

useful in resolving incorrect Uniform Resource Identifier 10 machine. 

(URIs) specifying FTP, SMTP or other Internet Protocol A representative Web Server platform 12 comprises an 

(IP)-based servers. IBM RISC System/6000 computer 18 (a reduced instruction 

The foregoing has outlined some of the more pertinent «' of RISC-based workstation) running the AIX 

objects and features of the present invention. These objects (Advanced Interactive Executive Version 4.1 and above) 

should be construed to be merely illustrative of some of the 15 Operating System 20 and a Web server program 22, such as 

more prominent features and applications of the invention. Netscape Enterprise Server Version 2.0, that supports mter- 

Manyotherbeneficialresultscanbeattainedby applyingthe face exte f 10 ^^! e P la f tform 12 ako mcl " de f a 8 ra P mcal 

disclosed invention in a different manner or modifying the "f r "f u rface ( G ™) * for management and administrate, 

invention as will be described. Accordingly, other objects ^ c Web s f ervcr , « al f° '" cludes an Application Program- 

and a fuller understanding of the invention may be had by 20 mm f> Interface (^0 23 tbat P r ° vide f ^tensions to enable 

referring to the following Detailed Description of the Pre- ^plication developers to extend and/or customize the core 

ferred Embodiment functionality thereof through software programs commonly 

referred to as "plug-ins" or helper applications. 

BRIEF DESCRIPTION OF THE DRAWINGS A representative Web client is a personal computer that is 

25 x86-, PowerPC®- or RISC-based, that includes an operating 

For a more complete understanding of the present inven- system such as IBM® OS/2® or Microsoft Windows 95, and 

tion and the advantages thereof, reference should be made to that includes a browser, such as Netscape Navigator 3.0 (or 

the following Detailed Description taken in connection with higher), having a Java Virtual Machine (JVM) and support 

the accompanying drawings in which: f or application plug-ins and helper applications. 

FIG. 1 is a representative system in which the present 30 js^ j s well-known, the Web server normally accepts a 

invention is implemented; client request and returns a response. The operation of the 

FIG. 2 is a flowchart illustrating the conventional pro- server program 22 is governed by a number of server 

cessing associated with an HTTP request from the Web application functions (SAFs), each of which is configured to 

client to the Web server shown in FIG. 1; execute in a certain step of a sequence. This sequence, 

FIG. 3 is a block diagram of a preferred fuzzy URL 35 illustrated in FIG. 2, begins with authorization translation 

detection service implementation according to a preferred (AuthTrans) 30 during which the server translates any 

embodiment of the present invention; authorization information sent by the client into a user and 

„ i in i j * * * u a group. If necessary, the AuthTrans step may decode a 

FIG. 4 is a flowchart of a frizzy URL detection scheme {q ^ ^ ^ £ ^ ^ 

implemented in the service illustrated in FIG. 3; 40 name trans , alion (NameTrans)) the URL a^tcd with the 

FIG. 5 is a flowchart illustrating the operation the fuzzy request may be kept intact or it can be translated into a 

URL detection scheme at a Web server; system-dependent file name, a redirection URL or a mirror 

FIG. 6 is a flowchart illustrating one matching algorithm site URL, At step 34, called path checks (PathCheck), the 

for use in identifying a best match between an entered string server performs various tests on the resulting path to ensure 

and a given lexicon; 45 that the given client may retrieve the document. At step 36, 

FIG. 7A-7D illustrate a representative example of the sometimes referred to as object types (ObjectType), MIME 

operation of the matching algorithm; and (Multipurpose Internet Mail Extension) type information 

FIG. 8 illustrates how a modified version of the matching ( e -8- text/html, image/gif, etc.) for the given document is 

algorithm may be used to select a given candidate URL. identified. At step 38, called Service (Service), the Web 

50 server routine selects an internal server function to send the 

DETAILED DESCRIPTION OF THE result back to the client. This function can run the normal 

PREFERRED EMBODIMENT server service routine (to return a file), some other server 

function (such as a program to return a custom document) or 

The present invention is preferably implemented in a a CGI program. At step 39, called Add Log (AddLog), 

client-server computer network. Thus, by way of 55 information about the transaction is recorded, 

background, a representative Web client/Web server is illus- A Uniform Resource Locator (URL) has the following 

trated in FIG. 1. In particular, a client machine 10 is common syntax: 
connected to a Web server platform 12 via a communication 

channel 14. For illustrative purposes, channel 14 is the http://www.immc.com/directory/file 

Internet, an intranet, an extranet or any other known network 60 where "name" is a so-called "second level" domain name. In 

connection. Web server platform 12 is one of a plurality of the above example, the ".com" extension is merely illustra- 

servers which are accessible by clients, one of which is tive as other known or future extensions (e.g., .org, .edu, 

illustrated by machine 10. A representative client machine etc.) are or may be used. When a user enters a URL in a 

includes a browser 16, which is a known software tool used browser address field, some portion of the URL may be 

to access the servers of the network. The Web server 65 incorrect. The present invention implements a fuzzy 

platform (sometimes referred to as a "Web" site) supports URL detection scheme for "intelligent" resolution of the 

files in the form of hypertext documents and objects. The input error. Preferably, this detection scheme is distributed 
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across multiple components in the computer network 
including, without limitation, the client from which the URL 
request originates, an alternative server (e.g., located at an 
ISP), and at various target Web sites. It is not required that 
the fuzzy detection scheme be implemented at each of these 
components, however. As used herein, the URL (or other 
character string) resolution is said to be "intelligent" because 
the input errors are addressed or rectified by using a "close" 
or "best" matching strategy. A preferred "best match" strat- 
egy is described below, although any suitable matching 
algorithm may be implemented. 

FIG. 3 illustrates how the fuzzy detection scheme may be 
implemented across multiple components in the computer 
network. This "distributed" approach is desirable because, 
as noted above, a given URL error may be present in the 
domain name portion itself (namely, the IP address) or 
within the portion of the URL identifying particular direc- 
tory or file within the domain identified by the (correctly- 
entered) domain name portion. As seen in FIG. 3, at least one 
of a plurality of Web clients (each identified by reference 
numeral 40) is connectable to a plurality of Web servers 
42a-42/i, each of which support a plurality of related files. 
The client includes a fuzzy URL detection engine 44 for 
carrying out the method of the present invention. The 
computer network also includes one or more dedicated 
servers 46-46/1, each of which may supports its own fuzzy 
detection engine 44 as well. Thus, for example, dedicated 
server 46a is associated with Internet Service Provider (ISP) 
48. As used herein, however, a so-called "dedicated" server 
need not be physically remote from any other device or 
component. Thus, for example, a given "server" having this 
function may simply be a new task running in an existing 
machine. A management server 50 may be used to manage 
the servers 46a-46/i, and thus customers may subscribe to a 
fuzzy detection "service." 

In addition to detection engine support at the client and at 
the server 46, it is preferred that one or more of the Web 
servers also include the detection engine 44. As is well- 
known in the art, a set of related files may be supported on 
a Web server as a so-called "Web site". Supporting a 
detection engine at a Web site is desirable because, as will 
be seen, a user of a Web client may enter the domain name 
correctly (in which case the connection between client and 
server may be established) but still have erred in entering 
some other portion of the URL (e.g., a directory or file 
name). Thus, if the client machine is successful in making 
some contact with a Web site (in other words, the IP address 
was entered correctly), the Web site fuzzy detection engine 
is used. 

FIG. 4 is a flowchart of a preferred implementation of the 
distributed fuzzy URL detection scheme. The routine begins 
at step 51 when the user enters a character string, preferably 
in the address field of the Web client browser. At step 52, a 
test is done to determine whether the string is recognized. If 
so, the routine continues at step 54 with the client connected 
to the target server in the usual manner. If, however, the 
outcome of the test at step 52 indicates that the character 
string is not recognized, fuzzy detection is required. 
Preferably, analysis begins with the local URL detection 
engine. 

In particular, a test is done at step 55 to determine whether 
the IP address portion of the URL is recognized and a 
connection established. As noted above, the IP address is the 
second level domain name. If the outcome of the test at step 
55 indicates that the IP address has been recognized, then the 
client machine has made some at least some contact with a 
target server. The routine continues processing this branch as 
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described in the flowchart of FIG. 5. If, however, the routine 
indicates that the IP address has not been recognized, 
processing continues at step 56. At this step, the character 
string is indexed into a lexicon of server IP names that have 

5 been used by the Web client over a given "history" period. 
The history period may be user-selectable (using a Prefer- 
ences menu or the like) or the engine may be a default to a 
given setting (e.g., URLs accessed from the client over the 
last 30 days). A test is then done at step 58 to determine 

10 whether the character string "matches" against any entry in 
the lexicon with respect to a given confidence level. The 
confidence level may be user-selectable or a default value. If 
the outcome of the test at step 58 indicates that the character 
string "matches" a URL entry in the lexicon by the prede- 

15 termined threshold, the routine continues at step 60 to launch 
the browser to the URL. 

If, however, the outcome of the test at step 58 indicates 
that a "match" docs not exist (based on the given threshold), 
then the routine continues at step 62 with an evaluation of 

20 the rankings. At this point, a second test is preferably 
performed using a lower selectable or default threshold. 
Thus, for example, the best match algorithm may return one 
or more candidate URL choices that do not meet the 
threshold (for automatically launching the browser) but 

25 (given their ranking value) still are "close" to the character 
string entered. In such case, it may be desirable to display a 
subset of candidate URLs that may still include the intended 
URL. Thus, at step 63, a test is done to determine whether 
any candidate URLs returned from the fuzzy search satisfy 

30 a lower threshold value (which also may be preset or set to 
a default). If the outcome of the test at step 63 is positive, the 
routine continues at step 65 to display (in a listbox, for 
example) those candidate URLs that are ranked over the 
lower threshold (but below the upper threshold). At step 67, 

35 the user is prompted to select one of the displayed URLs. At 
step 69, a test is performed to determine whether the user has 
made a selection within a given timeout. If not, an error 
message is displayed at step 71. Otherwise, the browser is 
launched to the selected URL at step 73. 

40 If the outcome of the test at step 63 is negative, then the 
URL must be found elsewhere. In this case, the browser is 
launched to one of the dedicated servers 46, as discussed 
above. In the preferred embodiment, a given server 46 
supports a much larger database of most-recently used 

45 (MRU) URLs against which the fuzzy search may be run. 
A test is run at step 64 to determine if the browser has 
established a connection to the dedicated server 46. If not, 
the routine continues at step 66 and returns an error message. 
If, however, the browser has established a connection to the 

50 dedicated server, the routine continues at step 68 by running 
a fuzzy search against the server database. A test is then done 
at step 70 to determine whether a "match" is found within 
some degree of confidence. If the outcome of the test at step 
70 is positive, the browser is automatically launched to the 

55 identified URL at step 72. This may be accomplished by 
having the dedicated server 46 issue a HTTP 301 redirection 
request to the browser, using the identified URL. If, 
however, the outcome of the test at step 70 is negative 
(because, for example, a match is found but not with a high 

60 enough degree of confidence), the routine continues at step 
74 to return to the client a list of candidate URLs. At step 76, 
the candidate URLs are displayed by the browser (using 
local display resources, such as a listbox or other menu). The 
user is then prompted at step 78 to select one of the URLs 

65 listed. At step 80, a test is performed to determine whether 
the user has made a selection within a given timeout. If not, 
an error message is displayed at step 82. Otherwise, the 
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browser is launched to the selected URL at step 84. This URLs. In particular, step 118 preferably involves multiply- 

completes the processing. ing the number of occurrences of the candidate URL times 

FIG. 5 illustrates the operation of the fuzzy detection logic the number of pairs in which the candidate URL appears, 

at the Web server. As noted above, this routine is reached Thus, for example, www.usatoday.com occurs thirteen (13) 

where the IP address is correctly entered at the Web client 5 times overall and is present in twelve (12) of the pairs 

but some latter portion of that URL is incorrect or cannot be comprising the table in FIG. 7C. Thus, www.usatoday.com 

resolved at the target server. To this end, the routine begins receives a ranking of 156. The next best candidate URL, 

at step 86 by removing all or part of the IP address portion www.yahoo.com, occurs eight (8) times overall and is 

(which has been correctly recognized). The routine then present in seven (7) of the pairs, for a ranking of 56. FIG. 7D 

continues at step 88. In particular, the remaining portion of 10 illustrates the ranking table, with all four (4) candidate URLs 

the character string is indexed into a lexicon of directory/file identified. The candidate URL with the "highest" score is 

names available in the server. This test may be run against then selected as the match. This completes one type of best 

a portion of the available names (e.g., those that have been match algorithm that may be used. 

most recently used (MRU), or those that have been MRU to Thus, considering this example, the fuzzy URL search 

respond to an HTTP request from this particular client, and is would automatically launch the browser to www.usatoday- 

so on). This is the fuzzy search. The particular options may .com (if the 156 ranking satisfies the threshold set in step 58 

be selectable. A test is then done at step 90 to determine in FIG. 4). Alternatively, the candidate URL rankings are 

whether the character string "matches" against any entry in displayed to facilitate user selection. Presumably, the user 

the lexicon with respect to a given confidence level. The would then select www.usatoday.com given that this URL 

confidence level may be selectable or a default value. If the 20 has the highest ranking. 

outcome of the test at step 90 indicates that the character An alternative best match algorithm, which is preferred, 

string portion "matches" a directory/file entry in the lexicon uses the same three (3) tables illustrated in FIGS. 7A-7C but 

by the predetermined threshold, the routine continues at step processes the scores in a manner to force the selection of a 

92 to return the requested document to the client. URL with a given "low" ranking. In this example, the strings 

If, however, the outcome of the test at step 90 indicates 25 are "scored" by squaring the differences between the QQQ 

that a "match" does not exist (based, for example, on the values (those given by the incorrectly typed string) in FIG. 

given threshold), then the routine continues at step 94 to 7C and summing the squares together. Preferably, the sums 

return to the client a list of candidate URLs (having the are then increased by the square of the number of pairs lists 

fully-extended directory/file "candidates". At step 96, the from which the two strings are both absent. These values are 

candidate URLs are displayed by the browser (using local 30 then ranked lowest to highest. For example, www. usato- 

display resources, such as a listbox or other menu). The user day.com matches QQQ for all pairs except pair "dt" so the 

is then prompted at step 98 to select one of the URLs listed. difference is zero for all of the pairs items except "dt", where 

At step 100, a test is performed to determine whether the it is "1" (1-0), which, when squared, still yields 1. Because 

user has made a selection within a given timeout, If not, an www.usatoday.com is in the "do" and "ot" pairs lists, but the 

error message is displayed at step 102. Otherwise, the 35 string in question is not, and because the string in question 

browser is launched to the selected URL at step 104. This is in pair list "dt" and www.usatoday.com is not, the sum of 

completes the processing. the squares is augmented by (2+1) squared, or 9. Similar 

One simple matching algorithm is a "best match" tech- computations are performed for the other candidate URLs, 

nique illustrated in the flowchart of FIG. 6. For purposes of resulting in the table illustrated in FIG. 8. The "suggestion" 

discussion, this technique is described with respect to the 40 for correction would then be www.usatoday.com, which 

client-based URL detection scheme, although a similar (or (with the lowest value) is at the top of the list. Other 

alternative) methodology is implemented in the server 46 possibilities could be proposed to the user if their scores 

and/or Web server-based versions. In this example, the were close to the "low" value (in this illustrative example), 

history list (namely, the candidate URLs) include the fol- When the string in question is short and possibly many 

lowing: www.usatoday.com, www.ibm.com, www.cnn.com 45 close hits are found, the algorithm can be run with ordered 

and www.yahoo.com. The routine begins at step 110 to pairs instead of unordered pairs. Further, the similar entries 

generate the lexicon against which the fuzzy URL search is can be sorted by other factors such as popularity of the given 

performed. This step is generally performed "off-line" auto- string. Also, if the database becomes very large, it is useful 

matically (and not necessarily when the character string is to use "triples" of characters (unordered and ordered) and so 

entered). At step 110, each candidate URL is hashed into a 50 forth, as needed, to ensure that the scheme remains robust, 

set of N adjacent characters (irrespective of ordering) as Typically, the "pair" approach described above is sufficient 

illustrated, by way of example only, in FIG. 7A. In this at the client, although the 3-tuple or "higher" approach may 

example, each URL has been hashed or decomposed into its be more useful at the server (where there is a much larger 

constituent unordered "pairs" with the pairs then ordered database of entries). Of course, other suitable best match 

alphabetically. Thus, for example, the pair ".c" appears once 55 algorithms may be used. 

in www.usatoday.com, once in www.ibm.com, twice in The present invention provides numerous advantages. 

www.cnn.com, and once in www.yahoo.com. Each entry of Existing Web client-based "look ahead" approaches cannot 

the lexicon is preferably organized in this manner. recognize URLs that arc not "close" to URLs previously 

At step 112, the user enters the target URL. For purposes visited. Because the Web constantly changes, the use of such 

of illustration, it is assumed that the user desires to navigate 60 "local" lists to resolve misspelled URLs does not provide 

to www.usatoday.com but mistakenly types www.usatday- suitable results. Using the present invention, a user can 

.com. The routine then continues at step 114 to hash the specify a vaguely-remembered character string and let the 

input URL into a hash table as illustrated in FIG. 7B. The system automatically determine the intended domain name, 

routine then continues at step 116 to intersect (through a The method and system thus resolve an incorrect character 

logical AND operation) the lexicon (in FIG. 7A) with the 65 string into an electronic address known to the computer 

hash table (in FIG. 7B). The result of step 116 is illustrated network. Preferably, if a URL is entered at a Web client 

in FIG. 7C. At step 118, the routine then ranks the candidate incorrectly, the system automatically performs a fuzzy 
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search that returns a list of URLs that most closely match 
what was originally entered into the browser address field. 
The user can then select the correct URL from the list and 
launch the browser to the desired site or file within that site. 
The disclosed method and system thus recognizes user- 5 
specified server names that are misspelled and provides a 
mechanism for connecting the client to the server irrespec- 
tive of that error. 

The above -described functionality is preferably imple- 
mented across the client, a given target server, or in a server 10 
supported at or associated with an Internet Service Provider 
(ISP) or other computer network device or system. 
Generalizing, the software is simply a computer program 
product implemented in a computer-readable medium or 
otherwise downloaded to the client, server or other device is 
over the computer network. With respect to the client piece, 
the functionality may be built into the browser, or is imple- 
mented as a browser plug-in or helper application. 
Alternatively, the functionality may be implemented as a 
Java applet or application. 20 

The "fuzzy" URL scheme, of course, may be generalized 
for any Uniform Resource Identifier ("URI"), of which the 
URL is a special case. Thus, the present invention may be 
used in other Internet services including, without limitation, 
file transfer (using the file transfer protocol (FTP)), point- 25 
to-point messaging or e-mail (using the simple message 
transport protocol (SMTP), and the like. 

In the preferred embodiment, "entry" of a URL is typi- 
cally accomplished using a keyboard associated with the 
client machine. This is not a limitation of the present 30 
invention, however. The particular manner by which the 
incorrect URL is entered is not a limitation of the invention. 
Thus, for example, a URL may be entered by other than 
keyboard entry (e.g., voice commands or by a suitable 
speech recognizer), 35 

One of the preferred implementations of invention is thus 
as a set of instructions (program code) in a code module 
resident in the random access memory of the computer. Until 
required by the computer, the set of instructions may be 
stored in another computer memory, for example, in a hard 40 
disk drive, or in a removable memory such as an optical disk 
(for eventual use in a CD ROM) or floppy disk (for eventual 
use in a floppy disk drive), or downloaded via the Internet 
or other computer network. 

In addition, although the various methods described are 45 
conveniently implemented in a general purpose computer 
selectively activated or reconfigured by software, one of 
ordinary skill in the art would also recognize that such 
methods may be carried out in hardware, in firmware, or in 
more specialized apparatus constructed to perform the 50 
required method steps. 

Further, as used herein, "Web client" should be broadly 
construed to mean any computer or component thereof 
directly or indirectly connected or connectable in any known 
or later-developed manner to a computer network, such as 55 
the Internet. The term "Web server*' should also be broadly 
construed to mean a computer, computer platform, an 
adjunct to a computer or platform, or any component 
thereof. Of course, a "client" should be broadly construed to 
mean one who requests or gets the file, and "server" is the 60 
entity which downloads the file. As previously discussed, the 
features of the invention may be implemented in any IP 
client, and not just a HTTP-compliant Web client running a 
Web browser. 

Having thus described our invention, what we claim as 65 
new and desire to secure by Letters Patent is set forth in the 
following claims. 



What is claimed is: 

1. A method of intelligently resolving an unrecognized 
character string entered at an IP client connectable to a 
plurality of IP servers in a computer network, each of the IP 
servers having an IP address, comprising the steps of: 

responsive to entry of the unrecognized character string at 
the IP client, performing a fuzzy search of the unrec- 
ognized character string against a lexicon of the IP 
server addresses; 

returning a list of IP server addresses that most closely 
match the unrecognized character string; and 

connecting the IP client to an IP server identified by one 
of the IP server addresses in the list. 

2. The method as described in claim 1 wherein the 
computer network is the Internet, the IP client is an HTTP- 
complaint Web client having a Web browser, and at least one 
IP server address is associated with a Uniform Resource 
Locator (URL). 

3. The method as described in claim 1 wherein the lexicon 
includes a table of entries, each entry comprising a set of one 
or more of the IP addresses having a given character pair. 

4. The method as described in claim 3 wherein the frizzy 
search logically ANDs the table of entries with a hashed 
version of the unrecognized character string. 

5. The method as described in claim 1 further including 
the steps of: 

displaying the list of IP server addresses to a user of the 
IP client; and 

prompting the user of the IP client to select a given one 
of the IP server addresses in the list, wherein the 
selected IP server address is used in the connecting 
step. 

6. The method as described in claim 1 wherein the fuzzy 
search is performed at the client. 

7. The method as described in claim 1 wherein the fuzzy 
search is performed at a server in the computer network. 

8. The method as described in claim 1 wherein the IP 
server address is associated with a Uniform Resource Iden- 
tifier (URI). 

9. A method of intelligently resolving a character string 
entered at a Web client connectable to a plurab'ty of Web 
servers in a computer network, each of the Web servers 
associated with a Uniform Resource Locator (URL), com- 
prising the steps of: 

responsive to entry of a character string at the Web client, 
performing a fuzzy search of the character string 
against a lexicon of the Web server URLs; 

returning a list of URLs that most closely match the 
character string; 

displaying the list of URLs to a user of the Web client; and 

prompting a user of the Web client to select a given one 
of the URLs in the list. 

10. The method as described in claim 9 further including 
the step of: 

connecting the Web client to Web server identified by the 
selected URL. 

11. The method as described in claim 9 wherein the 
lexicon includes a table of entries, each entry comprising a 
set of one or more of the URLs having a given character pair. 

12. The method as described in claim 11 wherein the 
fuzzy search logically ANDs the table of entries with a 
hashed version of the character string. 

13. The method as described in claim 9 wherein the 
character string is entered in a browser address field of the 
Web client. 
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server URLs stored at the Web client; 



10 



15 



14. A method of resolving a character string entered at a 
Web client connectable to a plurality of Web servers in a 
computer network, the Web client having a browser, and 
each of the Web servers associated with a Uniform Resource 
Locator (URL), comprising the steps of: 5 

responsive to entry of a character string, performing a 
frizzy search of the character string against a first subset 
of Web server URLs stored at the Web client; 

if the character string meets a given criteria, launching the 
browser to a given Web server; and 

if the character string does not meet the given criteria, 
taking a predetermined action to resolve the character 
string into a URL. 

15. The method as described in claim 14 wherein the 
predetermined action includes the steps of: 

displaying to a user a list of candidate URLs; 
prompting the user to select a URL from the list; and 
launching the browser to a Web server identified by the 
selected URL. 20 

16. The method as described in claim 14 wherein the 
predetermined action includes the steps of: 

launching the browser to a server located in the computer 
network; and 

performing a fuzzy search of the character string against 
a second subset of Web server URLs stored at the 
server. 

17. The method as described in claim 16 further including 
the steps of: 30 

returning a list of candidate URLs from the server to the 
client; 

displaying the list of candidate URLs; 
prompting the user to select a URL from the list; and 
launching the browser to a Web server identified by the 
selected URL. 

18. A method of resolving a character string entered at a 
Web client connectable to a plurality of Web servers in a 
computer network, the Web client having a browser, and 
each of the Web servers associated with a Uniform Resource 
Locator (URL), comprising the steps of: 

responsive to entry of a character string, performing a 
fuzzy search of the character string against a set of Web 



35 



45 



if the character string includes a given domain name 
portion that is recognized as being part of one of the 
Web server URLs stored at the Web client, launching 
the browser to the Web server identified by the given 
domain name; and 50 

taking a predetermined action at the Web server to fully 
resolve the character string. 

19. The method as described in claim 18 wherein the 
predetermined action includes the steps of: 



performing a fuzzy search of a portion of the character 
string against a set of files stored at the Web server; 

returning to the client a file that most closely matches 
against the portion of the character string. 

20. A computer program product in a computer readable 
medium for use in a Web client for resolving a character 
string entered at a Web browser running on the Web client, 
comprising: 

means responsive to entry of a character string at the Web 
client for performing a fuzzy search of the character 
string against a lexicon of Web server URLs stored at 
the Web client; 

means responsive to the fuzzy search for returning a list 
of URLs that most closely match the character string; 
and 

means responsive to display of the list of URLs for 
prompting a user of the Web client to select a given one 
of the URLs in the list. 

21. The computer program product as described in claim 

20 wherein the lexicon includes a table of entries, each entry 
comprising a set of one or more of the URLs having a given 
character pair. 

22. The computer program product as described in claim 

21 wherein the fuzzy search logically ANDs the table of 
entries with a hashed version of the character string to 
generate the list of URLs. 

23. A computer program product in a computer readable 
medium for use in a Web client for resolving a character 
string entered at a Web browser running on the Web client, 
comprising: 

means responsive to entry of a character string at the Web 
client for determining whether the character string 
includes a domain name portion recognized as being 
part of one of the Web server URLs stored at the Web 
client; and 

means responsive to the determining means for launching 
the browser to the Web server identified by the given 
domain name to facilitate a fuzzy search of a remaining 
portion of the character string. 

24. A computer program product in a computer readable 
medium for use in a Web server for resolving at least a 
portion of a character string entered at a Web browser 
running on a Web client connectable to the Web server, 
comprising: 

means for performing a fuzzy search of the character 
string portion against a lexicon of files stored at the 
Web server; and 

means responsive to the fuzzy search for returning a file 
that most closely matches the character string portion. 
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